• Semi-CNN architecture for effective spatio-temporal Learning in action recognition 

      Leong, Mei Chee; Prasad, Dilip K.; Lee, Yong Tsui; Lin, Feng (Journal article; Tidsskriftartikkel; Peer reviewed, 2020-01-12)
      This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal features in video action recognition. Unlike 2D convolutional neural networks (CNNs), 3D CNNs can be applied directly on consecutive frames to extract spatio-temporal features. The aim of this work is to fuse the convolution layers from 2D and 3D CNNs to allow temporal encoding with fewer parameters ...